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ABSTRACT 



An algorithm is stated for finding the simple cycles in a digraph 
which is believed to be superior to previous algorithms. The algorithm 
is stated in a way which lends itself to use on a digital computer. 
Suitable modifications are presented which allow the algorithm to be 
applied to coalesced graphs. Finally, the algorithm is compared to a 
previously used technique, and is shown to require fewer operations. 
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I. INTRODUCTION 



One of the important problems in the theory of graphs, especially in 
its applications to physical systems, is that of locating the simple cycles 
in a directed graph. The author conducted an extensive literature search 
(all American Mathematical Society Reviews, loEoE.E. Circuit Theory Re- 
views, and American Computing Machinery Reviews), and was unable to find 

any really efficient methods for solving the problem. One of the more 

\ 

efficient methods was the algorithm of Danielson [3], which uses matrix 
techniques and is designed for application to undirected graphs, although 
it lends itself to directed graphs if certain modifications are made. In 
comparison with the algorithm which we shall develope, it will be seen 
that it is itself rather inefficient. 

We will first give a number of fundamental concepts, with examples, 
in order to familiarize the reader with the basics involved, and in order 
to make the paper as nearly self contained as possible. In this regard, 
we have adapted considerable material from Busacker and Saaty [2). 

After a consideration of the basic theory, we develope an algorithm 
for locating the simple cycles in a directed graph. In doing so, we state 
some specific definitions which may not be found in the literature, but 
which are fundamental to our algorithm. After presenting the algorithm, 
we give an example to show how it works, and then we prove that the algori- 
thm does indeed do what it is supposed to do. 

Following the statement of the algorithm, we give modifications which 
allow the algorithm to be applied to coalesced graphs (graphs in which a 
subgraph is represented by a single vertex. We then give an example, 
which is taken from circuit tlieory, and show how the algorithm is applied. 
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II. FUNDAMENTAL CONCEPTS 



Let S be a set. Suppose s is an element of S, and that t is an ele- 
ment of S. By the symbol (s6ct) we mean the unordered pair s and t (un- 
ordered in the sense that (s6ct) and (t&s) denote the same element). We 
shall use the symbol (S&S) to denote the unordered prodv.ct of S with it- 
self. Now, let V be any set which is non-empty, and let E be a set dis- 
joint from V. Define a mapping g from E into (V&V) . An undirected graph 
is the triple (V,E,g). We denote such a graph by 

G = G(V,E,g), 

or, simply 

G = G(V,E) 

if the mapping is understood. We call the elements of V vertices , the 
elements of E edges , and g is called the incidence mapping for G. 

^ di^^ected graph is the triple 

G,= G(V,E,g*), 

where V and E are defined as above and where g* maps E into (VxV), the 
unusual Cartesian product of V with itself. 

We use the terminology of Harary [4] and call a directed graph a di - 
graph . As in the undirected case, we call the elements of V vertices and 
the elements of E edges. We shall denote both elements of V and elements 
of E by subscripted lower case letters. If g* is one-to-one, we shall 
sometimes denote an element e of E by the pair vertices for which 

g*(e)^ (Vj^,V 2 )‘ Note that to each digraph there corresponds an undirected 
graph formed from the digraph by removing the direction from all edges. 

A r aph 

H = 
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is said to be a subgraph of a graph G = G(V,E,g) if and only if the following 
conditions hold: 

(i) V, 

(ii) e, 

(iii) = g|^ , 



. denotes the restriction of g to the subset E„ of E. 
H ^ H 



where g 

If e is an edge in a graph G = G(V,E,g), and if 

g(e) = (v&w) 

\ 

(or (v,w) or (w,v) in the case of a digraph), then e is said to be in - 
cident to V and w. If the graph is directed, then an edge e such that 

g(e) = (v,w) 

is said to be incident into w and incident out of v. Alternately, v is 
referred to as the initial vertex of e and w is referred to as the final 
vertex of e. A vertex v in a digraph whose incident edges are incident 
out of v is called a source ; a vertex w whose incident edges are incident 
into w is called a sink > 

Geometrically, we represent an undirected graph by associating each 
vertex with a geometrical point and each edge with a curve segment. A 
digraph is represented in the same way, except that the curve segment is 
directed according to the order of the vertices corresponding to the edge. 
Figure 1(a) shows a geometrical representation of an undirected graph, 
while Figure 1(b) shows a digraph. 




Figure 1(a) an undirected graph 
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For the graph in Figure 1(a), 



E = 



g:e. 






(v^&Vs) , - (v^&v^) , 



65 ’ ■ 

For the graph in 1(b), 

V - ^v;,v',v',v^.v'} , 



(Vj&V^) 



g : ej - (vj,vj), - (vj,v^), - (vj.v^) 



E' = 

, I , 



65 - (v^.v^), - (v^jV^). 

In the sequel, we shall consider only digraphs. 

In a digraph, two edges are said to be consecutive provided the 
initial vertex of one is the final vertex of the other. Thus, two edges 



e^^ and e^ such that 



g(ej^) = (a,b) and gCe^) = (b,c) 



are consecutive edges. A sequence of distinct consecutive edges of the 
form 



P ^ ^2 ^ ^3 ^ ^ ^n- 1^ 



such that 



g( 6 j^) = (v^.v^), gCe^) = (^2*^3), g(e^) = (v3,v^),...,g(e^_2) 

is called a path . Note that no edges are repeated in a path, however, 

vertices may be repeated. A subsequence P* of consecutive edges is 

called a subpath of the path P. If P* starts with the same edge e^ as 

P, we say P* is an initial subpath of P. We shall say a vertex v^ is 

attainable from a vertex v provided there is a path from v to v.. 
o ^ 01 
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Alternately, we say that v, is a descendant of v and that v is a pre- 

i o o 

decessor of 

We define a function d on the set ^ of all paths in a graph G in- 
to the set of natural numbers (including zero) by 



d(P) = n, P in , 

where n is the number of edges in the path P, d(P) is referred to as 
the length of the path P. Note that an edge is a path of length one. 

Let G = G(V,E,g) be a digraph. Define a multi-valued function f 
on V into V according to the following rule: 

f(v) = V* in V and there exists e in E such that g(e) = 

(v,v*)^ , for all V in V. We call f the descendant function of the 
vertex v. We define powers of f as follows: 



^(v) = [v 






V* in V, V* in 



D . \ , V in V, 



where is the set of all descendants of v by a path of length j. 
example, in Figure 1(b), we have 



For 



f(v{) = v'^ , f"^(vp = ^v|,vpv^] . 

A special type of path in a digraph is one which starts at a vertex 

V and ends at a vertex v , where v = v . Such a path is called a cycle 
o non 

If no vertex (except v^) is used more than once in a cycle, then the 
cycle is said to be simple . A simple cycle of length one is called a 
loop . A graph which contains no cycles is referred to as acyclic . 

Suppose G = G(V,E,g) is an acyclic graph in which all vertices are 
attainable from a single vertex v^ in V. Further, suppose no vertex has 
more than one edge incident into it. Then G is called a rooted ditree . 
v^ is called the root of G. Figure 2 below shows a rooted ditree 
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Note that a rooted ditree must have only one root. A vertex in a rooted 
ditree is said to be terminal if it has no descendants. Thus, in the 
graph of Figure 2, a^ is a root and terminal 

vertices. A branch in a rooted ditree is any path whose initial vertex 
is the root and whose final vertex is terminal. Hence, the sequence of 
edges 

B “ ( > ^2^ * ^^2 * ^5^ ^^5 ^ ^ 10^ ^ 

(note that since there are no multiple edges between any pair of vertices, 
there is no confusion if we represent an edge by its associated pair of 
vertices) is a branch of the ditree in Figure 2. 

A graph G = G(V,E,g) whose vertex set V can be partitioned into two 
non-null disjoint sets and such that E consists only of edges e^ 

such that 



g(e^) = or (vj,v^), in in V^, 

is called a bipatite directed graph or bi-digraph . The graph in Figure 
3 is a bi-digraph. 



Note that 
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THEOREM I ; If G is a bi-digraph, then all simple cycles in G are of 
even length* 

Proof ; Suppose C is a simple cycle in G of the form 

C “* (e-,ert> ***>^rt 

i z n-i n 

where d (C) is odd. If the length of C is odd, then the initial 
vertex of C must be different from the final vertex by the definition 
of bi-partite graph* But this is impossible since the initial and final 
vertex of a cycle must coincide* Therefore, d (C) is even. 
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III. AN ALGORITHM FOR FINDING THE SIMPLE CYCLES IN A DIGRAPH 



An important problem, especially in the application of graph techni- 
ques, is that of locating simple cycles in a digraph. In dealing v;ith 
large graphs (large in the sense that V and E contain many elements), 
it is useful to have an efficient algorithm which conserves memory space 
of a computer. The algorithm we shall develope in this chapter is de- 
signed to meet these requirements. In the final chapter of the paper, we 
will show how our algorithm compares with an algorithm given by Denielson 
(3]. 

Before starting, it will be necessary to introduce some new terminol- 
ogy. In the sequel, we shall denote the vertices of all graphs under 
consideration by integer-indexed lower case letters such that no two 
vertices have the same index. If j is an integer such that v^ occurs in 
a graph under consideration, we will say v^ is the vertex corresponding 
to the indexing integer j. 

DEFINITION 1 . 

Let G = G(V,E,g) be a digraph. Let v^ be an element of V. Then, 
the indexing integer j corresponding to v^ is said to be an adjacent 
element of v^ provided v^ belongs to f(v^^), where f is the descendant 
function of v^. 

For example, in the graph in Figure 3 above, the adjacent elements 
of v^ are the integers 6 and 9. 

DEFINITION 2 . 

If G = G(V,E,g) is a digraph, and if v^ is a vertex in V, then the 
adjacency set of v^ is the set of all adjacent elements of v^ (e.g., the 
adjacency set of v^ is the set of index-integers corresponding to the 
elements of f(v^)). 
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For example, in the graph in Figure 3, the adjacency set of a^ is 
the set ^ 6 , 9 } . If no vertices are adjacent to a vertex (i.e., if a ver- 
tex is a sink) we denote its adjacency set by 0 (the null set). 

In the discussion which follows, we assume that the graph under con- 
sideration, G = G(V,E,g) is a digraph on n vertices which are represented 
by the lower case letter v indexed 1 through n. We further assume that 
in order to communicate to the computer the structure of G, we have a 
list which associates each vertex in V with its adjacency set. 



DEFINITION 3. 



An attainability chain of a vertex v^ in a digraph G is a directed 
graph C^, which corresponds to a simple path P(v^,Vj) in G. The source 
vertex of is i (corresponding to v^) , and the sink vertex is either a 
sink vertex in G, or a copy k* of the first repeating vertex v^^ in the 
path. 



For example, consider the graph in Figure 3. The vertex a^^ has the 
following attainability chains: 




Figure 4. Attainability chains for the vertex a^ of the graph in Figure 3. 
Note that if a simple cycle occurs, as in the cycle composed of the ver- 
tices a^^, a^, a^, a^, then the attainability chain of any of these 

vertices terminates as soon as the first repetition of a vertex occurs. 
Also, note that each attainability chain is a rooted ditree with one 
branch. 
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DEFINITION 4. 



Tlie attainability tree for a vertex is a rooted ditree wirh a 
vertex corresponding to i in the root position and with branches con- 
sisting of all the attainability chains of v^. We will super-impose the 
initial subpaths of any attainability chains with identical initial sub- 
paths. 

The graph in Figure 5 shows the attainability tree for the vertex a^^ 
of Figure 3. 




We now give the formal statement of the algorithm, keeping in mind 
that the graph under consideration is a digraph G = G(V,E,g) on n ver- 
tices, and that we have a list of vertices and their corresponding 
attainability sets. 

STEP 1 . Inspect the list and eliminate all vertices whose index 
integers do not appear as adjacent elements. Obviously, these 
vertices could not belong to simple cycles because they have no 
edges incident into them (i.e., they are sources). Also, inspect 
the list and eliminate as roots all vertices whose adjacency set is 
0. Again, it is clear that such vertices cannot belong to any 
simple cycles because there are no edges incident out of them 
(i.e., they are sinks). We have thus eliminated all sources and 
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sinks as possible roots. We may now repeat the procedure on the 
remaining vertices until no more eliminations are possible. 

STEP 2 . Pick any vertex v^ from the r*st which has not been eli- 
minated and construct its attainability tree, consulting the 
origional list for incidence relations. Recall that we super- 
impose identical initial subpaths. 

STEP 3 . If an element is repeated along any attainability chain, 
record that element together with the elements between the two 
occurrence (in the order they appear). The vertices corresponding 
to these elements will compose a simple cycle in G. Note that 
repeated vertices must occur along the same branch of the at- 
tainability tree for a cycle to be located. 

STEP 4 . Eliminate as possible roots all elements which appeared in 
the attainability tree of v^, for if these vertices belonged to 
simple cycles, this would have been discovered in the attainability 
tree of v^. 

STEP 5 . If all vertices have been eliminated as possible roots, 
the algorithm terminates. If some vertices remain, choose any such 
vertex v^^ not eliminated as a possible root, construct its attain- 
ability tree and return to step 3. 

We now ask the following question: Does the algorithm locate all 

the simple cycles in a digraph (and only the simple cycles)? We answer 
this question with the following theorem: 
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THEOREM II , The algorithm outlined in Steps 1 through 5 above locates all 



of the simple cycles of a digraph, and it locates only simple cycles. 

Proof : First, note that no cycles will be located which are not 

simple, since a branch is terminated at the first repetition of a vertex. 

Now, suppose there is a simple cycle C in the digraph G which was 
not located by the algorithm. Then C can have no edges in common with 
any attainability tree considered in the sequence, for if it did, then 
by the fact that a branch is terminated only if a source, a sink, or a 
repetition occurs, C would have been located. Conversely, only those 
edges of G which involve sources or sinks in G are left out of the set 
of attainability trees. Tlius, we have established that if C is not 
located by the algorithm, then C does not belong to G, which is a contra- 
diction. Therefore, C must have been located by the algorithm. 

It should be noted at this point that some cycles may be located 
more than once. However, this will occur only in special cases. 

As an example of the application of this algorithm, consider the 
digraph shown in Figure 6 below. 
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Wg have the following list of vertices with their adjacency sets: 




Applying Step 1, we eliminate as a possible root, since its adjacency 
set is 0. We also eliminate v^, v^, v^, and v^, since 1, 2, 3, and 4 
do not occur in any of the adjacency sets. Let us now construct the 
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We have located one simple cycle, namely 



Note that 5, 6, 7, 8, 9, 10, 11, 12, 13 all appeared in the attainability 
tree of and that 1, 2, 3, 4, and 14 have already been eliminated 

as roots. Hence, by Step 4, we need construct no more attainability 
trees, and we can conclude that C is the only simple cycle in the graph. 
An inspection of Figure 6 also leads to this conclusion. 

Note that if the graph under consideration were a bipartite digraph 
(or bi-digraph), we would have had two lists instead of one. In this 
case, we would add the following step the simplifications above: 

STEP 0 : Eliminate as possible roots all vertices which occur in the 

list with the most vertices. This will be sufficient to locate all 
simple cycles by Theorem I which states that all cycles in a bi- 
digrapl have even length, and hence must contain at least one vertex 
from each list. This simplification will be used in the next 
chapter. 

We observe at this point, that though the primary purpose of our 
algorithm as stated is to locate simple cycles, it may also be used to 
locate any simple path in a digraph. For example, if we wish to know 
if there is a simple path between a vertr^x v^ and a vertex v^ in a di- 
graph, we need only construct the attainability tree of v^ and note 

whether it contains v,. 

1 
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IV. COALESCED GRAPHS 



The purpose of the foregoing algorithm is to make possible the loca- 
tion of all simple cycles in a digraph with a large number of vertices. 

In practice, in the analysis of very large graphs, such as the graph of 
the circuitry of a digital computer, a procedure is used which partitions 
the graph into subgraphs which are, in turn, partitioned into smaller sub- 
graphs, until manageable subgraphs are obtained. Once these subgraphs 
have been analyzed for cyclic structure, they are generally coalesced 
into a single vertex, and the resulting graph is then analyzed for its 
cyclic structure. In this chapter, we will show how the technique of 
Chapter II may be applied to such ‘‘coalesced graphs. “ 

One of the principal problems in locating the simple cycles in a 
coalesced graph is that a vertex which is a coalesced subgraph may be 
attainable from a vertex v, but a vertex v^ which is attainable from 
the coalesced subgraph may not be attainable from v in the original 
graph. For example, suppose G is the graph shown in Figure 7. 




y 



I 



e 




K 




•- 

b 



c 






Figure 7. A Digraph 
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Suppose that, by some procedure, we determine that the subgraph of G 
composed of the vertices a, b, c, d, e, and f, together with the edges 
connecting them, can be coalesced into a single vertex v^. Then, the 
graph shown in Figure 8 is the coalesced version of the graph in Figure 

7. 








Figure 8. The Coalesced Version of the Graph in Figure 7 
Note that this graph is bipartite, since one node actually represents a 
subgraph, while the others represent single vertices in the original 
graph. Also, note that in the graph in Figure 8, v* is a descendant of 
V but, in the original graph (Figure 7), it is not. Thus, we see that 
our algorithm cannot be applied to coalesced graphs without some modi- 
fication. 

Let G = G (V ,E ,g ) be a digraph without coalesced vertices, 
o o o o’^o ^ ^ 

Let S be a subgraph of G . Form the graph G = G (V ,E ,g ) in which 

o o coo o 

S has been coalesced into a single vertex. s. Let f be the descendent 

o 

function on the vertices of G , and let f be the descendent function on 

o c 

the vertices of G . Define subsets I(S) and J(S) on the vertex set V 

of G as follows: 
c 

I(S) = ^v^ in V^: s is in 

J(S) = £vj in V^; v^ is in 
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Now, define a mapping p on I(S) x J(S) into 0,1 such that 



P(ViV^) = 



0 if V. is not in f^(v.) for all k, 
j o i 



1 if there exists k such that v, is in f (v.). 

V. j o' i' 

Compute p(v^,v^) for all pairs with in I(S) and in J(S) 

as follows: Let s. be in S such that s. is also in f (v.). Let s. be 

1 1 o i j 

such that V. is in f (s.). Note that s^ and s^ must exist since v. is 

J o' j' i j I 

in I(S) and v. is in J(S). Let be the attainability tree of s^, 

i 

and suppose V is the vertex set of T . Then, if s . is in V , 

s . s , J s . 

1 1 1 



P(Vi,v^) = 1, 

otherwise, 

p(v.v .) =0. 

1 J 

Note that may not exist, in which case p(v^Vj) is obviously zero. 

When p(v^,v^) has been computed for all ordered pairs with 

v^ in I(S) and v^ in J(S), we apply the algorithm of Chapter II to the 

bi-digraph G^. Suppose we do this and obtain simple cycles C^^, ... , C^, 

Note that by Theorem I, the length of C^ (i = 1, ... , n) will be even. 

Hence, if m, is the number of vertices in C. , then m./2 is the number of 
i 11 

coalesced subgraphs in C^. Denote these coalesced subgraphs by 

... , k. 



^im^/2- 



Let 



J(k 



J-J 



V 


in 


V 


: k 


t m 


c 




V. 


in 


V 


: V 


/ i 


c 





ij 



is in f (v 






f 1, ••• , Tiy ^ 1, •••,m,/2 



, is in f 
1 c 



, i — 1, ... , nj j 1, ...,m^/2 

For each k^^ (i = 1, ..., n; j = 1, ..., ra^/2) form the ordered pair 

(tj_j^,t^^j^) , where is in I(k^j) and is in J(k^j) and both tj 

and t . , , are in C, . Now, the ordered pair (t, ,, t.,,) must be in the 
J+i i j”i j+1 
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domain of p, since is in x J(k^j). Note the value of 

p(t. zero for any j (j = 1, ... , m^/2), then is 

not actually a cycle in G^. If it is one for all j, then is indeed a 

cycle in G . For example, suppose G is the graph in Figure 9 below. 

O o 

\/^ O 

N^. 




Figure 9. A Digraph 




This is a digraph of an adder from a digital 



computer* Suppose we have 



a procedure which partitions into the four subgraphs shown in Figure 
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We now apply our algorithm to the four subgraphs. We have the following 
lists of vertices together with their adjacency sets; 



Subgraph A 


Su^ 


raph B 


Subf»] 


raph C 


Subgraph D 


^^2 •• 


[21] 


'^l ' 


{27} 


''15* 


£53,54,551 


^^8 • 


I25} 




[21] 


'^8 ^ 


{24} 


'^16- 


{53,54) 




{25} 




£22} 




[24} 


^7 = 


{66,68} 


''11* 


[79} 




[22] 


'^10* 


[34} 


'^18* 


{54} 


'^12* 


{79} 




[23] 


'^27 = 


{23} 


V19: 


{33] 




£8o] 




[23} 


''24* 


[36.37] 


'^20* 


{56} 


''14- 


[so] 


'^21* 


{29} 


'^28* 


{32} 


'^53' 


{59,653 


'^25' 


538,39] 


''22* 


[29] 


^36' 


{32,33] 


'^54' 


{60] 


''79’ 


{26} 


'^23* 


{29} 


'^35* 


{33} 


^5- 


{60] 


''80* 


be] 


'^29* 


{30} 


''40* 


{33} 


'' 56 ' 


\eol 


''35* 


[76} 


'^30* 


{35} 


''37* 


{34} 


^63' 


{66,673 


''38' 


[76,77} 


V35; 


0 


'^32* 


{41,42} 


V59: 


{68} 


''39* 


00 

0 ^ 


''63* 




'^33* 


[41,43} 


'"60* 


566,67,68,69^ 


''26* 


[31] 


'^71' 


[74 


'"34* 


[41,44} 


''65* 


[67,69} 


''31* 




'^72^ 


^?4 


''42* 


[30} 


'^64* 


{68,69} 


''40* 


[77} 


'^70* 


i73) 


'^43* 


[50} 


"'66' 


{75} 


''10* 


{78} 


'^73* 


^74] 


''44* 


{30} 


'^ 67 ’ 


[753 


''76* 


{45,46} 


''74' 


£-} 


''50' 


£31} 


'^68* 


{7i3 


''77* 


{45,473 


''75^ 


0 


\l- 


[-] 


'^69* 


{72] 


''78* 


[45,48] 


''61- 


{si] 


\ 5 ’ 


[37} 


'^75* 


0 


''46' 


{52} 


''62' 


t«] 


'^Sl* 


{34,5s} 


'^71* 


0 


''47' 


£52! 


'^81'* 


{64} 


'^58’ 


{62} 


'^72* 


0 


''48* 


{52} 


''64' 


0 


^7- 


{61,63} 






''49* 


{78} 






'"61. 


"62 ,''63- ® 






''52^ 


{49} 














''45 = 


0 
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We have the following attainability trees for the subgraphs A through D: 








41* 



tS-KO 






-40 



ai — ^ 



CYCUe 






UNO V. no C'tfCttt ^ 

ATTKm AftJc-vnv Pod u 



£mo 




i::i: 

^ “iSo* 



€^t«0 

,63 



£N0 






EMI> <iMt> 

Arr<»^K>4 feiUTN poA C 

CVtK:(78,‘W,St,<,‘») o9«>a3^ 
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»a4 Atc 

fiMO £^0 



41 4> 



1 



|0 



ICLd 



_ 75 7 ? 

WTT^^^i^fl\U^T7 pod W , two 

Figure 11. Attainability Trees for the Subgraphs of Figure 10. 
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We have located two simple cycles: 







' <'' 50 ’'' 51 ’'' 34 >'' 44 •^ 0 > 

^2 ‘ *'’ 78 ’'' 48 >'' 52 ’'’ 49 ’'' 78 * 

We now coalesce the four subgraphs into vertices and examine the re- 
sulting bi-digraph for cyclic structure in order to discover if there are 
any larger simple cycles in G . Figure 12 shows the coalesced graph G . 






Figure 12. The Coalesced Graph of the Graph in Figure 9. 



Since this is a bipartite graph, we have two lists of vertices: 



A: £35,64,75} 

B: £41,62,61,631 

C: £75,71,72} 

D: £40,45] 
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We now invoke step 0 and consider as possible roots only those vertices 
which appear in list V^. We construct the attainability tree for vertex 
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We see that vertices B, C, and D, appear in the attainability tree of A, 
so it is unnecessary to construct any more attainability trees (by Step 
4). 

We have located the following simple cycles in G : 

c 

c, 

c, 

Cc 

c. 

J 

Cc 

S 

We must now compute the values of p to see which of these simple cycles 
in is also a simple cycle in G^. We shall do this only for Cj^. The 
others would be done exactly the same. 

We first note that involves the coalesced vertices A and B, with 
A in )> 35 ® ^c^'^ 35^’ '^62 Hence, 
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= (a,V35 
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“(A.Vjj 


.B.v^^.A) 
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-(A.Vjj 
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=(A,Vj5 


,B,v^3,C,v^j,A) 




=(A,»35 
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-(A,V35 
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-(A.V 33 




'9 


=(A,V33 
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=(A.V33 


,D,v^0»^«''63’^'''72’^) 
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=(A.V33 
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KA.V 35 
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=<*.''64 


,C,v^2»^^ 
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we must compute P(Vg2’'^35^ compute P(Vg2*'^35^’ 

examine subgraph A, and note that 62 does not appear in any of the at- 
tainability trees of the vertices of A. Hence, we must construct the 

attainability tree of to see if v__ is a descendant of v,„: 

oz bz 




Figure 14. The Attainability Tree of 

We see that 35 does not appear in the attainability tree of v,,-and thus, 

o2 

p(v,_,v„_) = 0. Therefore, C, is not a cycle in G . We will not repeat 
bz i o 

the procedure for C 2 through however, if we did, we would find that 

none of these are cycles in G • 

In actual applications of this technique, we would first determine 
the cyclic stxnicture of each subgraph which is coalesced into a node. 

In addition to retaining the resulting information about cycles, we 
would also note p(v^,Vj) for each pair of nodes external to the coalesced 
subgraphs. We would then use this information in constructing the attain- 
ability trees for the coalesced graph. Had we done this above, the at- 
tainability tree in Figure 13 would have been less complicated. We did 
not follc\7 this procedure above in order to better illustrate the use of 
the function p. 
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V. A COMPARISON V/ITH ANO^HIER ALGOTHITHM 



We shall now show how the algorithm developed in Chapter II for 
locating the simple cycles of a digraph is much superior, in terms of 
operations required, to another algorithm used for the same purpose. 

Danielson [3] developed an algorithm for locating all paths (in- 
cluding cycles) in an undirected graph using matrix techniques. His 
method, while designed for undirected graphs, can easily be modified 
for application to digraphs. We now state the necessary definitions to 
explain his algorithm. 

DEFINITION 5 . 

If G - G(V,E,g) is an undirected graph on n vertices (v, , ... , v ) 

1 n 

and m edges (e^, * then the adjacency matrix A^ of G is given by 

A = (a . , where 
® ^ j p 

1 if an edge connects v. to v,, 

^ 0 otherwise. 

V* 



DEFINITION 6. 



If G = G(V,E,g) 
and m edges (e^^, . . . 
is given by A = (®j| j 



is an undirected graph on n vertices (v^^, ... , v^) 
, e^) , then the variable adjacency matrix A of G 



) , whe re 



.■(V 



f edge ej^ joins v^ to v^ , 
otherwis- . 



DEFINITION 7 . 

If G = G(V,E,g) is an undirected graph on n vertices (v , ... , v ), 

1 n 

then the modified variable adjacency matrix B of G is given by B = (b, .), 
where 
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To illustrate these concepts. 
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consider 



j If an edge joins 
otherwise 

the graph of Figure 15, 



to V , 




Figure 15. An Undirected Graph 
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The algorithm states that all edge progressions of length k between 
and Vj will appear in the i-j position of Pj^, where is defined re- 
crusively as follows: 

P2 . BA_^, . BP^,^. 
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Hence, for the graph in Figure 15, 




V V +VJV +VJV 



12 $2 5 3 




V2V3 




0 



V V V 



r2 ''14 2 u 3 




V, V„+V, V 



li 2 h 3 • 
Vi 

V 4V2V^ 



We first note that, as stated before, the algorithm only locates 
the paths and cycles in an undirected graph. However, it is possible to 
modify the procedure slightly to obtain the cycles of a directed graph. 

How this is done is unimportant; what is important is that to locate all 
paths (including cycles) of lengths, k, k-1, ... , 2 in a digraph, we 
are required to multiply n x n matrices k times. In particular, if we 
were seeking all simple cycles in a complicated graph on n vertices, 
we would have no choice but to perform matrix multiplication n times. 

Now, in using this algorithm, we are not performing matrix multiplication 
in the usual manner, since the elements of the of B symbols 

(i.e., subscripted lower case letters) rather than numbers. Hence, what 
we are really doing is concatenating symbols in order to obtain the ele- 
ments of the Note that the symbol + is included in this concaten- 

ation each time strings of symbols are added together. For example, the 
element of above which appears in the second row and fourth column 
(ViV 2 +v^V 2 +v^Vj) is the concatenation of eight symbols. interpret 

this string of symbols as follows: there are three paths of length three 
from vertex v^ to vertex v, , namely via vertices v, and v^ or via vertices 
v^ and V 2 or via v^ and v^* We observe that the basic operation of 
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Danielson’s algorithm is the writing down of a single symbol. That is, 
in writing the element mentioned above, eight basic operations were in- 
volved. This process takes up more time and memory space than that re- 
quired to perform a comparison of two integers which is the basic operation 
of our algorithm. 

As was mentioned earlier, we are required to perform n nxn matrix 
multiplications in order to analyze an arbitrary graph for cyclic struc- 

3 

ture using Danielson’s algorithm. Hence, we must generate n elements, 
each of which may be a long string of symbols. Though some of these ele- 
ments may be zero, at worst, as in the case where each pair of vertices 
is joined by an edge and a loop occurs at each vertex, all elements would 

be non-zero. Now, in the event we were to analyze such a graph, each 

2 k-1 

element of would contain n terms of (k-l)n symbols for the nodes 

k-1 2 

and n - 1 plus (+) signs. Altogether, each of the n terms of P 

lx 

would require 

(k-l)n + n -1 = kn - 1 

symbols. Hence, if the basic operation is that of writing down a single 
symbol, then the number of operations required to locate all paths (and 
thus, cycles) in the worst case is given by 

D = - l) . 

k=2 



To obtain an upper bound for the number of basic operations (i.e., 
comparison of integers) using our algorithm, we consider a special type 
of graph, namely the complete symmetric graph on n vertices. A digraph 
G = G(V,E,g) is said to be complete if for every pair of vertices v^ and 
Vj in V, there exists an edge e in E such that either g(e) = (v^^,Vj), or 
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g(e) = ^ digraph G = G(V,E,g) is said to be symmetric if 

for every edge e such that g(e) = there 

exists an edge e* such that g(e‘) = Thus* a complete sym- 

metric graph on 4 vertices is shown in Figure 16 below: 




Figure 16. A Complete Symmetric Graph on 4 Vertices 
We will compute the number of comparisons ^ necessary to analyze this 
graph for cycles. Clearly this will'be an upper bound on the number of 
comparisons required to analyze any digraph without parallel edges, since 
any other digraph with these properties will be a subgraph of the complete 
symmetric graph on the same number of vertices. 

Note that the adjacency set of each vertex in a complete symmetric 
graph will contain n elements. Hence, at the first stage of the attain- 
ability tree (i.e., at the stage involving all vertices attainable by a 
path of length one from the root vertex), we have n elements which must 

be compared with one element, namely the root vertex. At this stage, 

/ 

we will locate one simple cycle: the loop which occurs at the root vertex. 

Thus, we continue on to the second stage with n(n-l) vertices. But, at 
this second stage, we will locate all of the other n-1 loops and all of 
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the simple cycles of length two which involve the root vertex. There 
are n-1 of these. Therefore, we continue on to the third stage with 
(n(n-l) - (n-1) - (n-1)) n = (n-1) (n-2)n vertices. Now, at this stage, 
we locate (n-1) (n-2) loops (obviously some are duplications), (n-1) (n-2) 
cycles of length two, and (n-1) (n-2) cycles of length three. Thus, we 
continue on to the fourth stage with 

(n(n-l)(n-2) - (n-1) (n-2) - (n-1) (n-2) - (n-l)(n-2))n 
* (n(n-l)(n-2) - 3(n-l) (n-2))n = (n-1) (n-2) (n-3)n 
vertices. At the kth stage, we will have (n-1) (n-2) ... (n-k+l)n vertices. 
Now, each of these must be compared with k vertices. Hence, at the kth 
stage, there are kn(n-l)(n-2) ... (n-k+1) comparisons to make. We demon- 
strate this more formally in the following theorem. 

THEOREM III . For a complete symmetric digraph (with loops at each vertex) 
on n vertices, there will be (n-1) (n-2) ... (n-k+l)n vertices at the kth 
stage of the attainability tree. 

Proof : The proof is by induction on k. Suppose k is 1. Then the 

number of vertices is n, which is clearly the case, since in the complete 
symmetric graph on n vertices, each element has n terms in its adjacency 
set. 

Now, suppose the conclusion holds for k = p. That is, suppose that 
at the pth stage, there are (n-1) (n-2) ...(n»p+l)n vertices. At this stage, 
we will locate (n-1) (n-2) ... (n-p+1) loops, a like number of cycles of 
lengths 2, 3, ... , p. Hence, at the p-ll st stage, we will have 

((n(n-l) (n-2) ... (n-p+1) - p(n-l)(n-2) ... (n-p+l))n 
= n(n-l)(n-2) ... (n-p+1) (n-p) 

vertices. Thus, since p was arbitrary, the conclusion holds in general. 



35 



We see, then, that an upper bound on the number of comparisons re- 
quired to analyze an n-vertex digraph is given by 

n 

C = kn(n-l) I /(n-k) ! 

k=l 

k- 1 

Now, since n is clearly greater than (n-1) J /(n-k) J for k greater than 
one, C will be less than D (for n'2r3). In fact, as n becomes large, 
the difference between D and C becomes very great indeed. Also, since 
the complete symmetric graph occurs rarely in practice the number of 
comparisons will generally be very much less than the upper bound C (e.g., 
the graph in Figure 6 which required 79 comparisons as opposed to several 
million for the complete symmetric graph (with loops) on 14 vertices). 
However, since the non-zero elements of will generally disappear 
quickly from as k increases, the number of basic operations required 
to carry out Danielson*s algorithm will often approach D. To illustrate, 
we will compare the number of basic operations required to analyze the 
graph in Figure 17(a) using Danielson's algorithm with the number of 
basic operations required to analyze the graph in 17(b) using our algori- 




Using our algorithm on Figure 17(b), we obtain the attainability tree 
shown in Figure 18 (the only one for this graph, since all vertices 
appear in it). 
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Figure 18. The Attainability Tree for the Graph of Figure 17(b) 
We see that this graph requires 22 comparisons of integers. 

Using Danielson's algorithm on the graph in Figure 17(a), we have 
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V V +V V- 
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t,Vj4t^T3*V2T3 v,T3«t2T, iTjT3 



We see that this required 78 basic operations. 
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VI. CONCLUSIONS 



We have developed an algorithm for locating the simple cycles of a 
digraph which appears to be superior to previous algorithms in terms of 
the number of basic operations required and which is conservative in its 
use of memory space on the digital computer. In particular, we have analyzed 
our algorithm and the one originated by Danielson [3], and found that the 
upper bound for the number of basic operations using Danielson's algorithm 
was given by 

D= S(kn‘^“Li) , 
k=2 

while the upper bound on the number of basic operations using our 'algori- 
thm was given by 

kn(n-l) I / (n-k ) ! 

k=l 

k-1 

Since n is greater than (n-1) I / (n-k) , we found that C was less than D. 
Moreover, since the basic operations required by Danielson's algorithm 
are more complex than those required by our algorithm, x<je see that our 
method represents a considerable improvement over the matrix technique. 

By introducing the function p defined in Chapter IV, we were able 
to modify our algorithm for application to coalesced graphs. Thus, we 
can analyze extrem^ily cumbersome graphs using the techniques developed, 
without being required to perform an undue number of basic operations. 

It is our opinion that we have accomplished the goal we set out to 
achieve. However, it is believed that the algorithm might be made even 
more efficient. It was noted after the proof of Theorem II that some 
simple cycles may be located more than once. We suggest that there may 
be a way of eliminating this ambiguity. If this could be done, then the 
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number of basic operations required for certain graphs would be cut down 
considerably. 

It was noted that Danielson's algorithm was designed for undirected 
graphs. Clearly, it is not very efficient. Ue suggest that a method 
similar to the one we have developed could be devised for application 
to undirected graphs which would be superior to Danielson's. 
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